Variational image compression with a scale hyperprior

نویسندگان

  • Johannes Ballé
  • David Minnen
  • Saurabh Singh
  • Sung Jin Hwang
  • Nick Johnston
چکیده

We describe an end-to-end trainable model for image compression based on variational autoencoders. The model incorporates a hyperprior to effectively capture spatial dependencies in the latent representation. This hyperprior relates to side information, a concept universal to virtually all modern image codecs, but largely unexplored in image compression using artificial neural networks (ANNs). Unlike existing autoencoder compression methods, our model trains a complex prior jointly with the underlying autoencoder. We demonstrate that this model leads to state-of-the-art image compression when measuring visual quality using the popular MS-SSIM index, and yields rate–distortion performance surpassing published ANN-based methods when evaluated using a more traditional metric based on squared error (PSNR). Furthermore, we provide a qualitative comparison of models trained for different distortion metrics.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hierarchical Variational Models (Appendix)

Relationship to empirical Bayes and RL. The augmentation with a variational prior has strong ties to empirical Bayesian methods, which use data to estimate hyperparameters of a prior distribution (Robbins, 1964; Efron & Morris, 1973). In general, empirical Bayes considers the fully Bayesian treatment of a hyperprior on the original prior—here, the variational prior on the original meanfield—and...

متن کامل

Variational posterior distribution approximation in Bayesian super resolution reconstruction of multispectral images

In this paper we present a super resolution Bayesian methodology for pansharpening of multispectral images. By following the hierarchical Bayesian framework, and by applying variational methods to approximate probability distributions this methodology is able to: (a) incorporate prior knowledge on the expected characteristics of the multispectral images, (b) use the sensor characteristics to mo...

متن کامل

The effect of Combined Decongestive Therapy and pneumatic compression pump on body image in women with breast cancer related lymphedema

Introduction: Patients with breast cancer who have two positive axillary lymph nodes, along with mastectomy, they undergo axillary node dissection. Lymphedema after axillary surgery is a feared complication. This women experience pain and body image impairments. Any intervention to reduce lymphedema, affects the body image of these patients. Method: This study is a randomized, single-blind clin...

متن کامل

A Novel Color Image Compression Method Using Eigenimages

Since the birth of multi–spectral imaging techniques, there has been a tendency to consider and process this new type of data as a set of parallel gray–scale images, instead of an ensemble of an n–D realization. Although, even now, some researchers make the same assumption, it is proved that using vector geometries leads to better results. In this paper, first a method is prop...

متن کامل

Hierarchical Bayesian estimates of distributed MEG sources: theoretical aspects and comparison of variational and MCMC methods.

Magnetoencephalography (MEG) provides millisecond-scale temporal resolution for noninvasive mapping of human brain functions, but the problem of reconstructing the underlying source currents from the extracranial data has no unique solution. Several distributed source estimation methods based on different prior assumptions have been suggested for the resolution of this inverse problem. Recently...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1802.01436  شماره 

صفحات  -

تاریخ انتشار 2018